Reconstruction of Trees from Jumbled and Weighted Subtrees
نویسندگان
چکیده
Let T be an edge-labeled graph, where the labels are from a finite alphabet Σ. For a subtree U of T , the Parikh vector of U is a vector of length |Σ| which specifies the multiplicity of each label in U . We ask when T can be reconstructed from the multiset of Parikh vectors of all of its subtrees, or all of its paths, or all of its maximal paths. We consider the analogous problems for weighted trees. We show how several well-known reconstruction problems on labeled strings, weighted strings and point sets on a line can be included in this framework. We present reconstruction algorithms and non-reconstructibility results, and extend the polynomial method, previously applied to jumbled strings [Acharya et al, SIAM J on Discr. Math, 2015] and weighted strings [Bansal et al, CPM 2004], to deal with general trees and special tree classes. 1998 ACM Subject Classification F.2.2 [Nonnumerical Algorithms and Problems] Computations on discrete structures, G.2.2 [Graph Theory] Graph labeling, Trees
منابع مشابه
Weighted quartets phylogenetics.
Despite impressive technical and theoretical developments, reconstruction of phylogenetic trees for enormous quantities of molecular data is still a challenging task. A key tool in analyses of large data sets has been the construction of separate trees for subsets (e.g., quartets) of sequences, and subsequent combination of these subtrees into a single tree for the full set (i.e., supertree ana...
متن کاملPatching Up X-trees
A fundamental problem in many areas of classification, and particularly in biology, is the reconstruction of a leaf-labeled tree from just a subset of its induced subtrees. Without loss of generality, we may assume that these induced subtrees all have precisely four leaves. Of particular interest is the question of determining whether a collection of quartet subtrees uniquely defines a parent t...
متن کاملEnumeration of subtrees of trees
Let T be a weighted tree. The weight of a subtree T1 of T is defined as the product of weights of vertices and edges of T1. We obtain a linear-time algorithm to count the sum of weights of subtrees of T . As applications, we characterize the tree with the diameter at least d, which has the maximum number of subtrees, and we characterize the tree with the maximum degree at least ∆, which has the...
متن کاملNon-hereditary Minimum Deep Coalescence trees
— One of the goals of phylogenetic research is to find the species tree describing the evolutionary history of a set of species. But the trees derived from geneti data with the help of tree inference methods are gene trees that need not coincide with the species tree. This can for example happen when so-called deep coalescence events take place. It is also known that species trees can differ fr...
متن کاملThe Subtree Size Profile of Bucket Recursive Trees
Kazemi (2014) introduced a new version of bucket recursive trees as another generalization of recursive trees where buckets have variable capacities. In this paper, we get the $p$-th factorial moments of the random variable $S_{n,1}$ which counts the number of subtrees size-1 profile (leaves) and show a phase change of this random variable. These can be obtained by solving a first order partial...
متن کامل